MMSS : graph-based multi-modal story-oriented video summarization and retrieval
نویسندگان
چکیده
We propose multi-modal story-oriented video summarization (MMSS) which, unlike previous works that use fine-tuned, domain-specific heuristics, provides a domain-independent, graph-based framework. MMSS uncovers correlations between information of different modalities and gives meaningful story-oriented news video summaries. MMSS can also be applied for video retrieval, achieving performance that matches the best traditional retrieval techniques (OKAPI and LSI), with no fine-tuned heuristics such as tf/idf.
منابع مشابه
Multi-modal Video Summarization Using Hidden Markov Models for Content-based Multimedia Indexing
MULTI-MODAL VIDEO SUMMARIZATION USING HIDDEN MARKOV MODELS FOR CONTENT-BASED MULTIMEDIA INDEXING Yaşaroğlu, Yağız MSc., Department of Electrical and Electronics Engineering Supervisor: Associate Professor A. Aydın Alatan September 2003, 75 pages This thesis deals with scene level summarization of story-based videos. Two different approaches for story-based video summarization are investigated. ...
متن کاملTREC 2003 Video Retrieval and Story Segmentation Task at NUS PRIS
This paper describes the details of our systems for story segmentation task and search task of the TREC-2003 Video Track. In story segmentation task, we propose a two-level multi-modal framework. First we analyze the video at the shot level using a variety of low and high-level features, and classify the shots into pre-defined categories using a Decision Tree. Next we perform HMM analysis in or...
متن کاملGraph Hybrid Summarization
One solution to process and analysis of massive graphs is summarization. Generating a high quality summary is the main challenge of graph summarization. In the aims of generating a summary with a better quality for a given attributed graph, both structural and attribute similarities must be considered. There are two measures named density and entropy to evaluate the quality of structural and at...
متن کاملDiscovery and fusion of salient multimodal features toward news story segmentation
In this paper, we present our new results in news video story segmentation and classification in the context of TRECVID video retrieval benchmarking event 2003. We applied and extended the Maximum Entropy statistical model to effectively fuse diverse features from multiple levels and modalities, including visual, audio, and text. We have included various features such as motion, face, music/spe...
متن کاملDiscovery and Fusion of Salient Multi-modal Features towards News Story Segmentation
In this paper, we present our new results in news video story segmentation and classification in the context of TRECVID video retrieval benchmarking event 2003. We applied and extended the Maximum Entropy statistical model to effectively fuse diverse features from multiple levels and modalities, including visual, audio, and text. We have included various features such as motion, face, music/spe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015